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BACKGROUND OF THE INVENTION 

Development of hybrid plant breeding has made possible considerable advances in 
quality and quantity of crops produced. Increased yield and combination of desirable 
characteristics, such as resistance to disease and insects, heat and drought tolerance, along 
with variations in plant composition are all possible because of hybridization procedures. 
These procedures frequently rely heavily on providing for a male parent contributing 
pollen to a female parent to produce the resulting hybrid. 

Field crops are bred through techniques that take advantage of the plant's method of 
pollination. A plant is self-pollinating if pollen from one flower is transferred to the same 
or another flower of the same plant. A plant is cross-pollinated if the pollen comes from a 
flower on a different plant. 

In Brassica . the plant is normally self sterile and can only be cross-pollinated. In 
self-pollinating species, such as soybeans and cotton, the male and female plants are 
anatomically juxtaposed. During natural pollination, the male reproductive organs of a 
given flower pollinate the female reproductive organs of the same flower. 

Maize plants (Zea mays jL) present a unique situation in that they can be bred by 
both self-pollination and cross-pollination techniques. Maize has male flowers, located on 
the tassel, and female flowers, located on the ear, on the same plant. It can self or cross 
pollinate. Natural pollination occurs in maize when wind blows pollen from the tassels to 
the silks that protrude from the tops of the incipient ears. 

A reliable method of controlling fertility in plants would offer the opportunity for 
improved plant breeding. This is especially true for development of maize hybrids, which 
relies upon some sort of male sterility system and where a female sterility system would 
reduce production costs. 
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The development of maize hybrids requires the development of homozygous 
inbred lines, the crossing of these lines, and the evaluation of the crosses. Pedigree 
breeding and recurrent selection are two of the breeding methods used to develop inbred 
lines from populations. Breeding programs combine desirable traits from two or more 
5 inbred lines or various broad-based sources into breeding pools from which new inbred 
lines are developed by selfing and selection of desired phenotypes. A hybrid maize variety 
is the cross of two such inbred lines, each of which may have one or more desirable 
characteristics lacked by the other or which complement the other. The new inbreds are 
crossed with other inbred lines and the hybrids from these crosses are evaluated to 
1 0 determine which have commercial potential. The hybrid progeny of the first generation is 
u designated Fj. In the development of hybrids only the F] hybrid plants are sought. The 
jJ Fj hybrid is more vigorous than its inbred parents. This hybrid vigor, or heterosis, can be 

manifested in many ways, including increased vegetative growth and increased yield, 
y i Hybrid maize seed can be produced by a male sterility system incorporating 

rjl5 manual detasseling. To produce hybrid seed, the male tassel is removed from the growing 
; , female inbred parent, which can be planted in various alternating row patterns with the 
-ry male inbred parent. Consequently, providing that there is sufficient isolation from sources 
J of foreign maize pollen, the ears of the female inbred will be fertilized only with pollen 
□ from the male inbred. The resulting seed is therefore hybrid (Fl) and will form hybrid 
20 plants. 

Environmental variation in plant development can result in plants tasseling after 
manual detasseling of the female parent is completed. Or, a detasseler might not 
completely remove the tassel of a female inbred plant. In any event, the result is that the 
female plant will successfully shed pollen and some female plants will be self-pollinated. 

25 This will result in seed of the female inbred being harvested along with the hybrid seed 
which is normally produced. Female inbred seed is not as productive as Fl seed. In 
addition, the presence of female inbred seed can represent a germplasm security risk for 
the company producing the hybrid. 

Alternatively, the female inbred can be mechanically detasseled by machine. 

30 Mechanical detasseling is approximately as reliable as hand detasseling, but is faster and 
less costly. However, most detasseling machines produce more damage to the plants than 
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hand detasseling. Thus, no form of detasseling is presently entirely satisfactory, and a 
need continues to exist for alternatives which further reduce production costs and to 
eliminate self-pollination of the female parent in the production of hybrid seed, 

A reliable system of genetic male sterility would provide advantages. The 
laborious detasseling process can be avoided in some genotypes by using cytoplasmic 
male-sterile (CMS) inbreds. In the absence of a fertility restorer gene, plants of a CMS 
inbred are male sterile as a result of factors resulting from the cytoplasmic, as opposed to 
the nuclear, genome. Thus, this characteristic is inherited exclusively through the female 
parent in maize plants, since only the female provides cytoplasm to the fertilized seed. 
CMS plants are fertilized with pollen from another inbred that is not male-sterile. Pollen 
from the second inbred may or may not contribute genes that make the hybrid plants male- 
fertile. Usually seed from detasseled normal maize and CMS produced seed of the same 
hybrid must be blended to insure that adequate pollen loads are available for fertilization 
when the hybrid plants are grown and to insure cytoplasmic diversity. 

There can be other drawbacks to CMS. One is an historically observed association 
of a specific variant of CMS with susceptibility to certain crop diseases. This problem has 
discouraged widespread use of that CMS variant in producing hybrid maize and has had a 
negative impact on the use of CMS in maize in general. 

One type of genetic sterility is disclosed in U.S. Patents 4,654,465 and 4,727,219 to 
Brar, et al. However, this form of genetic male sterility requires maintenance of multiple 
mutant genes at separate locations within the genome and requires a complex marker 
system to track the genes and make use of the system convenient: Patterson also described 
a genie system of chromosomal translocations which can be effective, but which are 
complicated. (See, U.S. Patents No. 3,861,709 and 3,710,51 1.) 

Many other attempts have been made to improve on these drawbacks. For 
example, Fabijanski, et al., developed several methods of causing male sterility in plants 
(see EPO 89/3010153.8 publication no. 329,308 and PCT application PCT/CA90/00037 
published as WO 90/08828). One method includes delivering into the plant a gene 
encoding a cytotoxic substance associated with a male tissue specific promoter . Another 
involves an antisense system in which a gene critical to fertility is identified and an 
antisense to the gene inserted in the plant. Mariani, et al. also shows several cytotoxic 
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antisense systems. See EP 89/401, 194. Still other systems use "repressor" genes which 
inhibit the expression of another gene critical to male sterility. PCT/GB90/00102, 
published as WO 90/08829. 

A still further improvement of this system is one described at U.S. Patent No. 
5 5,478,369 (incorporated herein by reference) in which a method of imparting controllable 
male sterility is achieved by silencing a gene native to the plant that is critical for male 
fertility and replacing the native DNA with the gene critical to male fertility linked to an 
inducible promoter controlling expression of the gene. The plant is thus constitutively 
sterile, becoming fertile only when the promoter is induced and its attached male fertility 
10 gene is expressed. 

As noted, an essential aspect of much of the work underway with male sterility 
g systems is the identification of genes impacting male fertility. 

Jjj Such a gene can be used in a variety of systems to control male fertility including 

H those described herein. Previously, a male fertility gene has been identified in Arabidopsis 
[pi 5 thaliana and used to produce a male sterile plant. Aarts, et al, "Transposon Tagging of a 
^ Male Sterility Gene in Arabidopsis", Nature, 363:715-717 (Jun. 24, 1993). U.S. Patent 
yk No. 5,478,369 discloses therein one such gene impacting male fertility. In the present 
\ * invention the inventors provide novel DNA molecules and the amino acid sequence 
£ encoded that are critical to male fertility in plants. These can be used in any of the systems 
p20 where control of fertility is useful, including those described above. 

Thus, one object of the invention is to provide a nucleic acid sequence, the 
expression of which is critical to male fertility in plants. 

Another object of the invention is to provide a DNA molecule encoding an amino 
acid sequence, the expression of which is critical to male fertility in plants. 
25 Yet another object of the invention is to provide a promoter of such nucleotide 

sequence and its essential sequences. 

A further object of the invention is to provide a method of using such DNA 
molecules to mediate male fertility in plants. 

Further objects of the invention will become apparent in the description and claims that 
30 follow. 
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SUMMARY OF THE INVENTION 

This invention relates to nucleic acid sequences, and, specifically, DNA molecules 
and the amino acid encoded by the DNA molecules, which are critical to male fertility. A 
promoter of the DNA is identified, as well as its essential sequences. It also relates to use 
5 of such DNA molecules to mediate fertility in plants. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1. is a locus map of the male sterility gene SBMu200. 
FIG. 2. is a gel of a Southern Blot analysis of ECORI digested DNA from a 
10 Mu family segregating for male sterility and hybridized with a Mu8 
probe. 

O FIG. 3. is a Northern Blot analysis gel hybridized with a PstI fragment isolated from the 
m SBMu2003.1 clone. 

£ FIG 4 is the sequence of SBMu200 (The cDNA is SEQ ID NO.l, the protein is SEQ ID 
£hl5 NO.2). 

ss 

FIG. 5 is the genomic SBMu200 sequence (also referred to as SEQ ID NO. 7). 

FIG 6 is a comparison of the genomic SBMu200 sequence with the cDNA of SBMu200. 
u FIG. 7. is a Northern analysis gel showing developmental gene expression in 
% microsporogenesis of the gene SBMu200. 
M«20 FIG. 8 is the full length promoter of SBMu200 (SEQ ID No. 5) 

FIG. 9. is a bar graph showing luciferase activity after deletions of select regions of the 

SbMu200 promoter. 

FIG. 10 shows an essential region of the SBMu200 promoter (SEQ ID NO. 6). 
FIG 1 1 is a bar graph showing luciferase activity after substitution by restriction site linker 
25 scanning of select small (9-10bp) regions of the SBMu200 essential promoter fragment. 
FIG. 12 is a comparison of SBMu200 sorghum tassel and SBMu200 maize cDNA 8.1 (the 
sorghum DNA is SEQ No. 3 and the protein is SEQ ID NO. 4). 
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DISCLOSURE OF THE INVENTION 

All references referred to are incorporated herein by reference. 
Unless defined otherwise, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
5 invention belongs. Unless mentioned otherwise, the techniques employed or contemplated 
therein are standard methodologies well known to one of ordinary skill in the art. The 
materials, methods and examples are illustrative only and not limiting. 

Genetic male sterility results from a mutation, suppression, or other impact to one 
of the genes critical to a specific step in microsporogenesis, the term applied to the entire 
10 process of pollen formulation. These genes can be collectively referred to as male fertility 
genes (or, alternatively, male sterility genes). There are many steps in the overall pathway 
O where gene function impacts fertility. This seems aptly supported by the frequency of 
Jj* genetic male sterility in maize. New alleles of male sterility mutants are uncovered in 
IM materials that range from elite inbreds to unadapted populations. To date, published 
jVjiS genetic male sterility research has been mostly descriptive. Some efforts have been made 
^ to establish the mechanism of sterility in maize, but few have been satisfactory. This 
H= should not be surprising given the number of genes that have been identified as being 
j T responsible for male sterility. One mechanism is unlikely to apply to all mutations. 
± At Patent No. 5,478,369 there is described a method by which a male sterility gene 

M20 was tagged on maize chromosome 9. Previously, the only described male sterility gene on 
chromosome 9 was MS2, which has never been cloned and sequenced. See Albertsen, M. 
and Phillips, R.L., "Developmental Cytology of 13 Genetic Male Sterile Loci in Maize" 
Canadian Journal of Genetics & Cytology 23:195-208 (Jan. 1981). The only fertility gene 
cloned before that had been the Arabadopsis gene described at Aarts, et al, supra. 
25 The SBMu200 gene described herein is located on maize chromosome 1 and its 

dominant allele is critical to male fertility. The locus map is represented at Figure 1. It 
can be used in the systems described above, and other systems impacting male fertility. 

The maize family cosegregating for sterility was named SBMu200 and was found 
to have an approximately 5.5 Kb EcoRI fragment that hybridized with a Mu8 probe. A 
30 genomic clone from the family was isolated which contained a Mu8 transposon. A probe 
made from DNA bordering the transposon was found to hybridize to the same ~5.5Kb 
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EcoRl fragment. This probe was used to isolate cDNA clones from a tassel cDNA library. 

The cDNA for SBMu200 is 1906 bp, and the Mu insertion occurred in exon 1 of the gene. 

Figure 9 (discussed further below) represents the genomic nucleotide sequence. 

Expression patterns, as determined by Northern analysis, show tassel specificity with peak 
5 expression at about the quartet to quartet release stages of microsporogenesis. 

Further, it will be evident to one skilled in the art that variations, mutations, 

derivations including fragments smaller than the entire sequence set forth may be used 

which retain the male sterility controlling properties of the gene. One of ordinary skill in 

the art can readily assess the variant or fragment by introduction into plants homozygous 
10 for a stable male sterile allele of MS26, followed by observation of the plant's male tissue 

development. 

p The invention also includes those nucleotide sequences which selectively hybridize 

!f: to the SBMu200 nucleotide sequences under stringent conditions. In referring to a 
M sequence that "selectively hybridizes" with SBMu200, the term includes reference to 
ml 5 hybridization, under stringent hybridization conditions, of a nucleic acid sequence to the 
" specified nucleic acid target sequence to a detectably greater degree (e.g., at least 2-fold 
H 2 over background) than its hybridization to non-target nucleic acid, 
y- The terms "stringent conditions" or "stringent hybridization conditions" includes 

f: reference to conditions under which a probe will hybridize to its target sequence, to a 
M20 detectably greater degree than to other sequences (e.g., at least 2-fold over background). 
Stringent conditions are target-sequence-dependent and will differ depending on the 
structure of the polynucleotide. By controlling the stringency of the hybridization and/or 
washing conditions, target sequences can be identified which are 100% complementary to 
a probe (homologous probing). Alternatively, stringency conditions can be adjusted to 
25 allow some mismatching in sequences so that lower degrees of similarity are detected 
(heterologous probing). Generally, probes of this type are in a range of about 1000 
nucleotides in length to about 250 nucleotides in length. 

An extensive guide to the hybridization of nucleic acids is found in Tijssen, 
Laboratory Techniques in Biochemistry and Molecular Biology-Hybridization with 
30 Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and the 
strategy of nucleic acid probe assays", Elsevier, New York (1993); and Current Protocols 
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in Molecular Biology, Chapter 2, Ausubel, et al, Eds., Greene Publishing and Wiley- 
Interscience, New York (1995). See also Sambrook et al (1989) Molecular Cloning: A 
Laboratory Manual (2nd ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.). 

In general, sequences that correspond to the nucleotide sequences of the present 
invention and hybridize to the nucleotide sequence disclosed herein will be at least 50% 
homologous, 70% homologous, and even 85% homologous or more with the disclosed 
sequence. That is, the sequence similarity between probe and target may range, sharing at 
least about 50%, about 70%, and even about 85% sequence similarity. 

Specificity is typically the function of post-hybridization washes, the critical 
factors being the ionic strength and temperature of the final wash solution. Generally, 
stringent wash temperature conditions are selected to be about 5°C to about 2°C lower than 
the melting point (Tm) for the specific sequence at a defined ionic strength and pH. The 
melting point, or denaturation, of DNA occurs over a narrow temperature range and 
represents the disruption of the double helix into its complementary single strands. The 
process is described by the temperature of the midpoint of transition, Tm, which is also 
called the melting temperature. Formulas are available in the art for the determination of 
melting temperatures. 

Preferred hybridization conditions for the nucleotide sequence of the invention 
include hybridization at 42°C in 50%(w/v) formamide, 6X SSC, 0.5%(w/v) SDS, 100(g/ml 
salmon sperm DNA. Exemplary low stringency washing conditions include hybridization 
at 42°C in a solution of 2X SSC, 0.5% (w/v) SDS for 30 minutes and repeating. 
Exemplary moderate stringency conditions include a wash in 2X SSC, 0.5% (w/v) SDS at 
50°C for 30 minutes and repeating. Exemplary high stringency conditions include a wash 
in 2X SSC, 0.5% (w/v) SDS, at 65°C for 30 minutes and repeating. Sequences that 
correspond to the promoter of the present invention may be obtained using all the above 
conditions. For purposes of defining the invention, the high stringency conditions are 
used. 

Methods of aligning sequences for comparison are well-known in the art. Gene 
comparisons can be determined by conducting BLAST ( Basic Local Alignment Search 
Tool; Altschul, S. F„ et al, (1993) J. Mol. Biol 215:403-410; see also 
www.ncbi.nlm.nih.gov/BLAST/) searches under default parameters for identity to 
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sequences contained in the BLAST "GENEMBL" database. A sequence can be analyzed 
for identity to all publicly available DNA sequences contained in the GENEMBL database 
using the BLASTN algorithm under the default parameters. Identity to the sequence of the 
present invention would mean a polynucleotide sequence having at least 65% sequence 
5 identity, more preferably at least 70% sequence identity, more preferably at least 75% 
sequence identity, more preferably at least 80% identity, more preferably at least 85% 
sequence identity, more preferably at least 90% sequence identity and most preferably at 
least 95% sequence identity. 

Promoter regions can be readily identified by one skilled in the art. The putative 
10 start codon containing the ATG motif is identified and upstream from the start codon is the 
presumptive promoter. By "promoter" is intended a regulatory region of DNA usually 
P comprising a TATA box capable of directing RNA polymerase II to initiate RNA synthesis 
y at the appropriate transcription initiation site for a particular coding sequence. A promoter 
M* can additionally comprise other recognition sequences generally positioned upstream or 5' 
( n 15 to the TATA box, referred to as upstream promoter elements, which influence the 
% i transcription initiation rate. It is recognized that having identified the nucleotide 
y* sequences for the promoter region disclosed herein, it is within the state of the art to isolate 
V: and identify further regulatory elements in the region upstream fb the TATA box from the 
»P particular promoter region identified herein. Thus the promoter region disclosed herein is 
-5=20 generally further defined by comprising upstream regulatory elements such as those 

responsible for tissue and temporal expression of the coding sequence, enhancers and the 
like. In the same manner, the promoter elements which enable expression in the desired 
tissue such as male tissue can be identified, isolated, and used with other core promoters to 
confirm male tissue-preferred expression. 
25 The isolated promoter sequence of the present invention can be modified to provide 

for a range of expression levels of the heterologous nucleotide sequence. Less than the 
entire promoter region can be utilized and the ability to drive anther-preferred expression 
retained. However, it is recognized that expression levels of mRNA can be decreased with 
deletions of portions of the promoter sequence. Thus, the promoter can be modified to be 
30 a weak or strong promoter. Generally, by "weak promoter 11 is intended a promoter that 
drives expression of a coding sequence at a low level. By "low level" is intended levels of 
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about 1/10,000 transcripts to about 1/100,000 transcripts to about 1/500,000 transcripts. 
Conversely, a strong promoter drives expression of a coding sequence at a high level, or at 
about 1/10 transcripts to about 1/100 transcripts to about 1/1,000 transcripts. Generally, at 
least about 30 nucleotides of an isolated promoter sequence will be used to drive 

5 expression of a nucleotide sequence. It is recognized that to increase transcription levels, 
enhancers can be utilized in combination with the promoter regions of the invention. 
Enhancers are nucleotide sequences that act to increase the expression of a promoter 
region. Enhancers are known in the art and include the SV40 enhancer region, the 35S 
enhancer element, and the like. 

0 Sequences which hybridize to the sequences of the present invention are within the 

scope of the invention. Sequences that correspond to the promoter sequences of the 
present invention and hybridize to the promoter sequences disclosed herein will be at least 
50% homologous, 70% homologous, and even 85% homologous or more with the 
disclosed sequence. 

15 Smaller fragments may yet contain the regulatory properties of the promoter so 

identified and deletion analysis is one method of identifying essential regions. Deletion 
analysis can occur from both the 5' and 3' ends of the regulatory region. Fragments can be 
obtained by site-directed mutagenesis, mutagenesis using the polymerase chain reaction 
and the like. (See, Directed Mutagenesis: A Practical Approach IRL Press (1991)). The 

20 3' deletions can delineate the essential region and identify the 3' end so that this region 
may then be operably linked to a core promoter of choice. Once the essential region is 
identified, transcription of an exogenous gene may be controlled by the essential region 
plus a core promoter. The core promoter can be any one of known core promoters such as 
the Cauliflower Mosaic Virus 35S or 19S promoter (U.S. Patent No. 5,352,605), ubiquitin 

25 promoter (U.S. Patent No. 5,5 10,474) the IN2 core promoter (U.S. Patent No. 5,364,780) 
or a Figwort Mosaic Virus promoter (Gruber, et al. "Vectors for Plant Transformation" 
Methods in Plant Molecular Biology and Biotechnolog y ) et al. eds, CRC Press pp.89- 1 19 
(1993)). 

The regulatory region of SBMU200 has been identified as including the 1005 bp 
30 region upstream of the putative TATA box. See Figure 8. Further, using the procedures 
outlined above, it has been determined that an essential region of the promoter includes the 
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-180 bp upstream of the TATA box and specifically, the -176 to -44 region is particularly 
essential. 

Promoter sequences from other plants may be isolated according to well- known 
techniques based on their sequence homology to the promoter sequence set forth herein. 
5 In these techniques, all or part of the known promoter sequence is used as a probe which 
selectively hybridizes to other sequences present in a population of cloned genomic DNA 
fragments (i.e. genomic libraries) from a chosen organism. Methods are readily available 
in the art for the hybridization of nucleic acid sequences. 

The entire promoter sequence or portions thereof can be used as a probe capable of 
10 specifically hybridizing to corresponding promoter sequences. To achieve specific 

hybridization under a variety of conditions, such probes include sequences that are unique 
H and are preferably at least about 10 nucleotides in length, and most preferably at least 
St about 20 nucleotides in length. Such probes can be used to amplify corresponding 
I s * promoter sequences from a chosen organism by the well-known process of polymerase 
ill 15 chain reaction (PCR). This technique can be used to isolate additional promoter sequences 
" : from a desired organism or as a diagnostic assay to determine the presence of the promoter 
M- sequence in an organism. Examples include hybridization screening of plated DNA 
y k libraries (either plaques or colonies; see e.g. Innis et al., eds., (1990) PCR Protocols, A 
j~ Guide to Methods and Applications . Academic Press). 

M20 Further, the promoter of the present invention can be linked with nucleotide 

sequences other than the SBMu200 gene to express other heterologous nucleotide 
sequences. The nucleotide sequence for the promoter of the invention, as well as fragments 
and variants thereof, can be provided in expression cassettes along with heterologous 
nucleotide sequences for expression in the plant of interest, more particularly in the male 
25 tissue of the plant. Such an expression cassette is provided with a plurality of restriction 
sites for insertion of the nucleotide sequence to be under the transcriptional regulation of 
the promoter. These expression cassettes are useful in the genetic manipulation of any 
plant to achieve a desired phenotypic response. Examples of other nucleotide sequences 
which can be used as the exogenous gene of the expression vector with the SBMu200 
30 promoter include complementary nucleotidic units such as antisense molecules (callase 
antisense RNA, barnase antisense RNA and chalcone synthase antisense RNA, Ms45 
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antisense RNA), ribozymes and external guide sequences, an aptamer or single stranded 
nucleotides. The exogenous nucleotide sequence can also encode auxins, rol B, cytotoxins, 
diptheria toxin, DAM methylase, avidin, or may be selected from a prokaryotic regulatory 
system. By way of example, Mariani, et al., Nature; Vol. 347; pp. 737; (1990), have shown 
that expression in the tapetum of either Aspergillus oryzae RNase-Tl or an RNase of Bacillus 
amyloliquefaciens, designated "barnase," induced destruction of the tapetal cells, resulting in 
male infertility. Quaas, et al., Eur. J. Biochem . Vol 173: pp. 617 (1988), describe the chemical 
synthesis of the RNase-Tl, while the nucleotide sequence of the barnase gene is disclosed in 
Hartley, J. Molec. Biol : Vol. 202: pp. 913 (1988). The rolB gene of Agrobacterium 
rhizogenes codes for an enzyme that interferes with auxin metabolism by catalyzing the 
release of free indoles from indoxyl-B-glucosides. Estruch, et al, EMBO J. Vol. 1 1: pp. 3125 
(1991) and Spena, et al, Theor. Appl. Genet.: Vol. 84: pp. 520 (1992), have shown that the 
anther-specific expression of the rolB gene in tobacco resulted in plants having shriveled 
anthers in which pollen production was severely decreased and the rolB gene is an example of 
a gene that is useful for the control of pollen production. Slightom, et al., J. Biol. Chem . Vol. 
261: pp. 108 (1985), disclose the nucleotide sequence of the rolB gene. DNA molecules 
encoding the diphtheria toxin gene can be obtained from the American Type Culture 
Collection (Rockville, MD), ATCC No. 39359 or ATCC No. 6701 1 and see Fabijanski, et al, 
E.P. Appl. No. 90902754.2 , "Molecular Methods of Hybrid Seed Production" for examples 
and methods of use. The DAM methylase gene is used to cause sterility in the methods 
discussed at U.S. Patent No. 5,689,049 and PCT/US95/ 15229 Cigan, A.M. and Albertsen, 
M.C., "Reversible Nuclear Genetic System for Male Sterility in Transgenic Plants". Also see 
discussion of use of the avidin gene to cause sterility at U.S. Patent No. 5,962,769 "Induction 
of Male Sterility in Plants by Expression of High Levels of Avidin" by Albertsen et al. 

The invention includes vectors with the SBMu200 gene. A vector is prepared 
comprising the SBMu200 gene, a promoter that will drive expression of the gene in the 
plant and a terminator region. As noted, the promoter in the construct may be the native 
promoter or a substituted promoter which will provide expression in the plant. Selection 
of the promoter will depend upon the use intended of the gene. The promoter in the 
construct may be an inducible promoter, so that expression of the sense or antisense 
molecule in the construct can be controlled by exposure to the inducer. 
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Other components of the vector may be included, also depending upon intended 
use of the gene. Examples include selectable markers, targeting or regulatory sequences, 
stabilizing or leader sequences, etc.. General descriptions and examples of plant 
expression vectors and reporter genes can be found in Gruber, et al., "Vectors for Plant 
5 Transformation" in Method in Plant Molecular B iolopy and Biotechnology . Glick et al 
eds;CRC Press pp. 89-1 19 (1993). The selection of an appropriate expression vector will 
depend upon the host and the method of introducing the expression vector into the host. 
The expression cassette will also include at the 3' terminus of the heterologous nucleotide 
sequence of interest, a transcriptional and translational termination region functional in 
1 0 plants. The termination region can be native with the promoter nucleotide sequence of the 
present invention, can be native with the DNA sequence of interest, or can be derived from 
O another source. Convenient termination regions are available from the Ti-plasmid of A. 
Sy tumefaciens, such as the octopine synthase and nopaline synthase termination regions. See 
jj also, Guerineau et al. Mol. Gen. Genet. 262:141-144 (1991); Proudfoot £§JI_64: 67 1-674 
ms (1991); Sanfacon et al. Genes Dev. 5:141-149 (1991); Mogen et al. Plant Cell 2:1261-1272 
s " (1990); Munroe et al. fiene_91:15 1-158 (1990); Ballas et al. Nucleic Acids Res. 17:7891- 
N= 7903 (1989); Joshi et al. Nucleic Acid Res. 15:9627-9639 (1987). 
y> The expression cassettes can additionally contain 5' leader sequences. Such leader 

p: sequences can act to enhance translation. Translation leaders are known in the art and 
H20 include: picornavirus leaders, for example, EMCV leader (Encephalomyocarditis 5' 

noncoding region), Elroy-Stein et al. Proc. Nat. Acad. Sci. USA 86:6126-6130 (1989); 
potyvirus leaders, for example, TEV leader (Tobacco Etch Virus), Allison et al.; MDMV 
leader (Maize Dwarf Mosaic Virus), Virolog y 154:9-20 (1986); human immunoglobulin 
heavy-chain binding protein (BiP), Macejak et al. N_a!ure_353:90-94 (1991); untranslated 
25 leader from the coat protein mRNA of alfalfa mosaic virus (AMV RNA 4), Jobling et al. 
Nature 325:622-625 (1987); Tobacco mosaic virus leader (TMV), Gallie et al. (1989) 
Molecular Biology of RNA . pages 237-256; and maize chlorotic mottle virus leader 
(MCMV) Lommel et al. Virolog y 81:382-385 (1991). See also Della-Cioppa et al. Plant 
Physiolog y 84:965-968 (1987). The cassette can also contain sequences that enhance 
30 translation and/or mRNA stability such as introns. 
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In those instances where it is desirable to have the expressed product of the 
heterologous nucleotide sequence directed to a particular organelle, particularly the plastid, 
amyloplast, or to the endoplasmic reticulum, or secreted at the cell's surface or 
extracellularly, the expression cassette can further comprise a coding sequence for a transit 
peptide. Such transit peptides are well known in the art and include, but are not limited to, 
the transit peptide for the acyl carrier protein, the small subunit of RUBISCO, plant EPSP 
synthase, and the like. One skilled in the art will readily appreciate the many options 
available in expressing a product to a particular organelle. For example, the barley alpha 
amylase sequence is often used to direct expression to the endoplasmic reticulum (Rogers, 
J. Biol. Chem. 260:373 1-3738 (1985)). Use of transit peptides is well known (e.g., see 
U.S. Patents Nos. 5,717,084; 5,728,925). 

In preparing the expression cassette, the various DNA fragments can be 
manipulated, so as to provide for the DNA sequences in the proper orientation and, as 
appropriate, in the proper reading frame. Toward this end, adapters or linkers can be 
employed to join the DNA fragments or other manipulations can be involved to provide 
for convenient restriction sites, removal of superfluous DNA, removal of restriction sites, 
or the like. For this purpose, in vitro mutagenesis, primer repair, restriction digests, 
annealing, and resubstitutions, such as transitions and transversions, can be involved. 

As noted herein, the present invention provides vectors capable of expressing genes 
of interest under the control of the promoter. In general, the vectors should be functional 
in plant cells. At times, it may be preferable to have vectors that are functional in £. coli 
(e.g., production of protein for raising antibodies, DNA sequence analysis, construction of 
inserts, obtaining quantities of nucleic acids). Vectors and procedures for cloning and 
expression in E. coli are discussed in Sambrook et al. (supra). 

The transformation vector comprising the promoter sequence of the present 
invention operably linked to a heterologous nucleotide sequence in an expression cassette, 
can also contain at least one additional nucleotide sequence for a gene to be cotransformed 
into the organism. Alternatively, the additional sequence(s) can be provided on another 
transformation vector. 

Reporter genes can be included in the transformation vectors. Examples of suitable 
reporter genes known in the art can be found in, for example, Jefferson et al. (1991) in 
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Plant Molecular Biology Manual ed. Gelvin et al. (Kluwer Academic Publishers), pp. 1- 
33; DeWet et al. Mol. Cell. Biol . 7:725-737 (1987); Goff et al. EMBO J. 9:2517-2522 
(1990); Kain et al. BipTrehnigres 19:650-655 (1995); and Chiu et al. Current Biolog y 
6:325-330(1996). 

5 Selectable marker genes for selection of transformed cells or tissues can be 

included in the transformation vectors. These can include genes that confer antibiotic 
resistance or resistance to herbicides. Examples of suitable selectable marker genes 
include, but are not limited to, genes encoding resistance to chloramphenicol, Herrera 
Estrella et al. EMBO J. 2:987-992(1983); methotrexate, Herrera Estrella et al. Nature 
303:209-213(1983); Meijer et al. Plant Mol. Biol . 16:807-820 (1991); hygromycin, 
Waldron et al. Plant Mol. Biol . 5:103-108 (1985); Zhijian et al. Plant Science 108:219-227 
(1995); streptomycin, Jones et a l Mol. Gen. Genet 210:86-91(1987): spectinomycin, 
Bretagne-Sagnard et al Transgenic Res . 5:131-137 (1996); bleomycin, Hille et al. Plant 
Mol. Biol. 7:171-176 (1990); sulfonamide, Guerineau et al. Plant Mol. Biol . 15:127- 
136(1990); bromoxynil, Stalker et al. Sokn££.242:4 19-423 (1988); glyphosate, Shaw et al. 
Science 233:478-481(1986); phosphinothricin, DeBlock et al EMBO J . 6:2513-2518 
(1987). 

The method of transformation/transfection is not critical to the instant invention; 
various methods of transformation or transfection are currently available. As newer 
methods are available to transform crops or other host cells they may be directly applied. 
Accordingly, a wide variety of methods have been developed to insert a DNA sequence 
into the genome of a host cell to obtain the transcription or transcript and translation of the 
sequence to effect phenotypic changes in the organism. Thus, any method which provides 
for efficient transformation/transfection may be employed. 
25 Methods for introducing expression vectors into plant tissue available to one skilled 

in the art are varied and will depend on the plant selected. Procedures for transforming a 
wide variety of plant species are well known and described throughout the literature. See, 
for example, Miki et al, "Procedures for Introducing Foreign DNA into Plants" in 
Methods in Plant Molecular Biotechnology, supra; Klein et al, giQ/TechnplQgy 10:268 
30 (1992); and Weising et al, Ann. Rev. Genet 22: 421-477 (1988). For example, the DNA 
construct may be introduced into the genomic DNA of the plant cell using techniques such 
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as microprojectile-mediated delivery, Klein et aL, Nature 327: 70-73 (1987); 
electroporation, Fromm et aL Proc. Natl Acad. Sci. 82:5824(1985); polyethylene 
glycol (PEG) precipitation, Paszkowski et aL, EMBO J, 3: 2717-2722 (1984); direct gene 
transfer WO 85/01856 and EP No. 0 275 069; in vitro protoplast transformation U.S. 
5 patent no. 4,684,61 1; and microinjection of plant cell protoplasts or embryogenic callus. 
Crossway, Mol. Gen. Genetics 202:179-185 (1985). Co-cultivation of plant tissue with 
Agrobacterium tumefaciens is another option, where the DNA constructs are placed into a 
binary vector system. See e.g., U.S. Patent No. 5,591,616; Ishida et aL, "High Efficiency 
Transformation of Maize (Zea mays L.) mediated by Agrobacterium tumefaciens" Nature 
10 Biotechnolog y 14:745-750 (1996). The virulence functions of the Agrobacterium 

tumefaciens host will direct the insertion of the construct into the plant cell DNA when the 

h cell is infected by the bacteria. See, for example Horsch et aL, Science 233: 496-498 

g (1984), and Fraley et aL, Proc . Natl. Acad. Sci . 80: 4803 (1983). 

H 5 Standard methods for transformation of canola are described at Moloney et aL 

r S5? 

yr|15 "High Efficiency Transformation of Brassica napus using Agrobacterium Vectors" Plant 

Cell Reports 8:238-242 (1989). Corn transformation is described by Fromm et al, 
H= Bio/Technolog y 8:833 (1990) and Gordon-Kamm et al, supra, Agrobacterium is 
\2 primarily used in dicots, but certain monocots such as maize can be transformed by 
£ Agrobacterium. See supra and U.S. Patent No. 5,550,3 18. Rice transformation is 
M20 described by Hiei et aL, "Efficient Transformation of Rice (Oryza sativs L.) Mediated by 
Agrobacterium and Sequence Analysis of the Boundaries of the T-DNA" The Plant 
Journal 6(2): 271-282 (1994, Christou et al, Trends in Biotechnology 10:239 (1992) and 
Lee et al, Proc. Nat'l Acad. Sci. USA 88:6389 (1991). Wheat can be transformed by 
techniques similar to those used for transforming corn or rice. Sorghum transformation is 
25 described at Casas et al, supra and sorghum by Wan et al, Plant PhysicoL 104:37 (1994). 
Soybean transformation is described in a number of publications, including U.S. Patent no. 
5,015,580. 

Further detailed description is provided below by way of instruction and illustration 
and is not intended to limit the scope of the invention. 
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Example 1 

Identification and Cosegregation of SBMu200 
Families of plants from a mutator (Mu) population were identified that segregated 
for plants that were mostly male sterile, with none or only a few extruded abnormal 
5 anthers, none of which had pollen present. Male sterility is expected to result from those 
instances where a Mu element has randomly integrated into a gene responsible for some 
step in microsporogenesis, disrupting its expression. Plants from a segregating F 2 family 
in which the male sterile mutation was designated SBMu200, were grown and classified 
for male fertility/sterility based on the above criteria. Leaf samples were taken and DNA 
10 subsequently isolated on approximately 20 plants per phenotypic classification, that is 
male fertility vs. male sterility, 
p Southern analysis was performed to confirm association of Mu with sterility. 

"J Southern analysis is a well known technique to those skilled in the art. This common 
H procedure involves isolating the plant DNA, cutting with restriction endonucleases, 
HI 15 fractioning the cut DNA by molecular weight on an agarose gel, and transferring to nylon 

membranes to fix the separated DNA. These membranes are subsequently hybridized with 
lf_ a probe fragment that was radioactively labeled with P 32 P-dCTP, and washed in an SDS 
U solution. Southern, E., "Detection of Specific Sequences Among DNA Fragments by Gel 
% Electrophoresis," J. Mol. Biol . 98:503-3 17(1 975). Plants from a segregating F2 
H20 SBMu200 family were grown and classified for male fertility/sterility. Leaf samples and 
subsequent DNA isolation was conducted on approximately 20 plants per phenotypic 
classification. DNA (~7ug) from 5 fertile and 12 sterile plants was digested with EcoRI 
and electrophoresed through a 0.75% agarose gel. The digested DNA was transferred to 
nylon membrane via Southern transfer. The membrane was hybridized with an internal 
25 fragment from the Mu8 transposon. Autoradiography of the membrane revealed 

cosegregation of a 5.5 Kb EcoRI fragment with the sterility phenotype as shown in Figure 
2. This EcoRI band segregated in the fertile plants suggesting a heterozygous wild type 
condition for the allele. 
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Example 2 
Library Construction and Screening 
The process of cDN A library screenings is commonly known among those skilled 
in the art and is described at Sambrook, J., Fritsch, E.F., Maniatis T. 5 et al., Molecular 
5 Cloning: A Laboratory Manual Cold Spring Harbor Laboratory, Cold Spring Harbor Lab 
Press, Plainview, NY (1989). Libraries were created as follows. 

DNA from a sterile plant was digested with EcoRI and run on a preparative gel. 
DNA with a molecular weight between 5.0 and 6.0 Kb was excised from the gel, 
electroeluted and ethanol precipitated. This DNA was ligated into the Lambda Zap vector 
10 (Stratagene™) using the manufacturer's protocol. The ligated DNA was packaged into 
phage particles using Gigapack Gold (Stratagene™). Approximately 500,000 PFU were 
q plated and lifted onto nitrocellulose membranes. Membranes were hybridized with the 
M Mu8 probe. A pure clone was obtained after 3 rounds of screening. The insert was 
H excised from the phage as a plasmid and designated SBMu200-3. 1 . A PstI border 
yjs fragment from this clone was isolated and used to reprobe the orginal EcoRI cosegregation 
^ blot. The 5.5 Kb EcoRI fragment is homozygous in all the sterile plants, which confirms 
U that the correct Mu fragment was isolated. Three of the fertile plants are heterozygous for 
r: the 5.5 Kb EcoRI band and a 4.3 Kb EcoRI band. Two of the fertile plants are 

± homozygous for the 4.3 Kb EcoRI band, presumably the wild type allele. 

O 

L20 Example 3 

Expression Analysis and cDNA Isolation 
Northern analysis can be used to detect expression of genes characteristic of anther 
development at various states of microsporogenesis. Northern analysis is also a commonly 
used technique known to those skilled in the art and is similar to Southern analysis except 
25 that mRNA rather than DNA is isolated and placed on the gel. The RNA is then hybridzed 
with the labeled probe. Potter, E., et al., "Thyrotrotropin Releasing Hormone Exerts Rapid 
Nuclear Effects to Increase Production of the Primary Prolactin in RNA Transcript," Proc. 
Nat. Acad. Sci. USA 78:6662-6666 (1981), Lechelt, et al., "Isolation & Molecular 
Analysis of the Plows," Mol.Gen.Genet. 219:225-234 (1989). The PstI fragment from the 
30 SBMu200-3. 1 clone was used to probe a Northern blot containing kernel, immature ear, 
seedling and tassel RNA. A signal was seen only in tassel RNA at approximately the 
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quartet stage of microsporogenesis, as reflected in Figure 3. The transcript is about 2.3 KB 
in length. The same probe was also used to screen a cDNA library constructed from 
mRNA isolated from meiotic to late uninucleate staged anthers. One clone, designated 
SBMu200-8.1, was isolated from the library. 

Example 4 
Sggygngg Analysis 

The SBMu200-3.1 genomic clone and the cDNA clone, SBMu200-8.1, were 
sequenced by Loftstrand Labs Limited. Sanger, F., Nicklen, S., Coulson A.R. (1977) 
"DNA sequencing with chain terminating inhibitors" Proc. Natl Acad. Sci, USA 74:5463- 
5467. The sequences are set forth in Figure 4 and 5 and the comparison is at Figure 6. 
The cDNA/genomic comparison reveals five introns are present in the genomic clone. The 
Mu8 insertion occurs in exon 1. Testing for codon preference and non-randomness in the 
third position of each codon was consistent with the major ORF in the cDNA being the 
likely protein-coding ORF. There is a putative Met start codon at position 1089 in the 
genomic clone. The cDNA homology with respect to the genomic clone begins at 
nucleotide 1094. Thus SBMu200-8.1 does not represent a full length clone and lacks 5 
bases up to the putative Met start codon. A database search revealed significant homology 
to P450 enzymes found in yeast, plants and mammals. P450 enzymes have been widely 
studied and three characteristic protein domains have been elucidated. A comparison of 
the predicted protein from SBMu200-8.1 to the consensus amino acid sequence of these 
domains showed 1) 92% identity to the dioxygen binding domain, 2) 85% identity to the 
tridecapeptide domain (steroid binding), and 3) 100% identity to the C-terminal heme 
attachment domain. Further expression studies were done using the SBMu200-8.1 cDNA 
probe against a northern containing mRNA at discrete stages of microsporogenesis. Signal 
is detected from meiosis II/quartet to late-uninucleate, with maximal signal being observed 
from early-uninucleate through late-uninucleate as shown at Figure 7. 

Example 5 

Identification of Promoter and its Essential Regions 
A putative TATA box can be identified by primer extension analysis as described 
in by Current Protocols in Molecular Biology . Ausubel, F.M. et al. eds; John Wiley and 
Sons, New York pp.4.8.1 - 4.8.5 (1987). 
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Regulatory regions of anther genes, such as promoters, may be identified in 
genomic subclones using functional analysis, usually verified by the observation of 
reporter gene expression in anther tissue and a lower level or absence of reporter gene 
expression in non-anther tissue. The possibility of the regulatory regions residing 
5 "upstream" or 5' ward of the translational start site can be tested by subcloning a DNA 
fragment that contains the upstream region into expression vectors for transient expression 
experiments. It is expected that smaller subgenomic fragments may contain the regions 
essential for male-tissue preferred expression. For example, the essential regions of the 
CaMV 19S and 35S promoters have been identified in relatively small fragments derived 
10 from larger genomic pieces as described in U.S. Pat No. 5,352,605. 

The selection of an appropriate expression vector with which to test for functional 
expression will depend upon the host and the method of introducing the expression vector 
□ into the host and such methods are well known to one skilled in the art. For eukaryotes, 
the regions in the vector include regions that control initiation of transcription and control 
5 15 processing. These regions are operably linked to a reporter gene such as UidA, encoding p 

s ;""a 

\| -glucuronidase (GUS), or luciferase. General descriptions and examples of plant 
J\. expression vectors and reporter genes can be found in Gruber, et al., "Vectors for Plant 
Hi Transformation" in Methods in Plfrnt Molester Biolo g y ?n j BipteghnQlQgy; Glick, et al. 
£ eds; CRC Press; pp. 89-1 19; (1993). GUS expression vectors and GUS gene cassettes are 
H20 commercially available from Clonetech, Palo Alto, CA , while luciferase expression 
vectors and luciferase gene cassettes are available from Promega Corporation, Madison, 
WI. Ti plasmids and other Agrobacterium vectors are described in Ishida, Y., et al., 
Nature Biotechnology : Vol. 14; pp. 745-750; (1996) and in U.S. Pat. No. 5,591,616 
"Method for Transforming Monocotyledons" (1994). 
25 Expression vectors containing putative regulatory regions located in genomic 

fragments can be introduced into intact tissues such as staged anthers, embryos or into 
callus. Methods of DNA delivery include microprojectile bombardment, DNA injection, 
electroporation and Agrobacterium-mediated gene transfer (see Gruber, et al., "Vectors for 
Plant Transformation," in Methods in Plant Molecular Biology a nd Biotechnology. Glick, et 
30 al. eds.; CRC Press; (1993); U.S Pat. No. 5,591,616; and Ishida, Y., et al., Nainffi 



20 



ffi 



Biotechnology : Vol. 14; pp. 745-750; (1996)). General methods of culturing plant tissues 
are found in Gruber, et aL, supra and Click, supra. 

For the transient assay system, staged, isolated anthers are immediately placed onto 
tassel culture medium (Pareddy, D.R. and J.F. Petelino, Crop Sci. J .: Vol. 29; pp. 1564- 
5 1 566; (1989)) solidified with 0.5% Phytagel (Sigma, St. Louis) or other solidifying media. 
The expression vector DNA is introduced within 5 hours preferably by microprojectile- 
mediated delivery with 1.2 \xm particles at 1000 -1 100 Psi. After DNA delivery, the 
anthers are incubated at 26°C upon the same tassel culture medium for 17 hours and 
analyzed by preparing a whole tissue homogenate and assaying for GUS or for lucifierase 
10 activity (see Gruber, et al., supra). 

Upstream of the likely translational start codon of SBMu200, 1088 bp of DNA was 
present in the genomic clone SBMu200-3.1. Translational fusions via an engineered Ncol 
site were generated with reporter genes encoding luciferase and p-glucuronidase to test 
whether this fragment of DNA had promoter activity in transient expression assays of 
U! 15 bombarded plant tissues. Activity was demonstrated in anthers and not in coleoptiles, roots 
and calli, suggesting anther-preferred or anther-specific promoter activity. 

jf s A reasonable TATA box was observed by inspection, about 83-77 bp upstream of 

\ y 

the translational start codon. The genomic clone SBMu200-3.1 thus includes about 1005 

ass 

% bp upstream of the possible TATA box. For typical plant genes, the start of transcription is 
H20 26-36 bp downstream of the TATA box, which would give the SBMu200 mRNA a 5'- 
nontranslated leader of about 48-58 nt. The total SBMu200 subgenomic fragment of 1088 
bp, including nontranslated leader, start of transcription, TATA box and sequences 
upstream of the TATA box, was thus shown to be sufficient for promoter activity. See 
Figure 8, which is SEQ. ID NO.5. The putative TATA box (TATATCA) is underlined. 
25 Thus, the present invention encompasses a DNA molecule having a nucleotide sequence of 
SEQ ID NO: 5 (or those with sequence identity) and having the function of a male tissue- 
preferred regulatory region. 

Deletion analysis can occur from both the 5' and 3' ends of the regulatory region: 
fragments can be obtained by site-directed mutagenesis, mutagenesis using the polymerase 
30 chain reaction, and the like ( Directed Mu tagenesis: A Practical Approach: IRL Press; 
(1991)). The 3' end of the male tissue-preferred regulatory region can be delineated by 
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proximity to the putative TATA box or by 3' deletions if necessary. The essential region 
may then be operably linked to a core promoter of choice. Once the essential region is 
identified, transcription of an exogenous gene may be controlled by the male tissue-preferred 
region of SBMu200 plus a core promoter. The core promoter can be any one of known core 
5 promoters such as a Cauliflower Mosaic Virus 35S or 19S promoter (U.S. Pat No. 5,352,605), 
Ubiquitin (U.S. Pat. No. 5,510,474), the IN2 core promoter (U.S. Pat No. 5,364,780), or a 
Figwort Mosaic Virus promoter (Gruber, et at, "Vectors for Plant Transformation" in 
Methods in Plant Molecular Biology and Biotechnology ; Glick, et at eds.; CRC Press; pp. 
89-1 19; (1993)). Preferably, the promoter is the core promoter of a male tissue-preferred gene 
10 or the CaMV 35S core promoter. More preferably, the promoter is a promoter of a male 
tissue-preferred gene and in particular, the SBMu200 core promoter. 

Further mutational analysis, for example by linker scanning, a method well known to 
Q the art, can identify small segments containing sequences required for anther-preferred 
[T expression. These mutations may introduce modifications of functionality such as in the 
0 1 15 levels of expression, in the timing of expression, or in the tissue of expression. Mutations may 

in 

Sj also be silent and have no observable effect. 

J , The foregoing procedures were used to identify essential regions of the SBMu200 

fU promoter. After linking the promoter with the luciferase marker gene deletion analysis 
£ was performed on the regions of the promoter upstream of the putative TATA box, as 
f:20 represented in Figure 9. The x-axis of the bar graph indicates the number of base pairs 

immediately upstream of the putative TATA box retained in a series of deletion derivatives 
starting from the 5 5 end of the promoter. The y-axis shows the normalized luciferase 
activity as a percent of full-length promoter activity. 

As is evident from the graph, approximately 176 bp immediately upstream of the 
25 TATA box was sufficient, when coupled to the core promoter (putative TATA box through 
start of transcription), plus 5' nontranslated leader, for transient expression in anthers. By 
contrast, luciferase activity was minimal upon further deletion from the 5' end to 91 bp 
upstream of the putative TATA box. This 176 bp upstream of the putative TATA box 
through the nontranslated leader can be cosnidered a minimal promoter, which is further 
30 represented at Figure 10. The TATA box is underlined. Deletion within the full-length 
promoter from -176 through -92 relative to the TATA box reduced activity to about 1% of 
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wild type. Deletion of -39 through -8 did not greatly reduce activity. Therefore the -176 
to -44bp region contains an essential region and thus would constitute an upstream 
enhancer element conferring anther expression on the promoter, which we refer to as an 
"anther box". 

Linker scanning analysis was conducted across the anther box in 9-10 bp 
increments. The locations of the linker scanning substitutions in this region are shown in 
Figure 10, and the expression levels of the mutants relative to the wild type sequence are 
shown in Figure 1 1 . The most drastic effect on transient expression in anthers was 
observed for mutants LS 12 and LSI 3, in the region 52-71 bp upstream of the putative 
TATA box. A major effect on transient expression in anthers was also observed for 
mutants LS06, LS07, LS08 and LS10, within the region 82-131 bp upstream of the 
putative TATA box. Sequences within the anther box required for wild type levels of 
transient expression in anthers are thus demonstrated in the -52 to -131 region relative to 
the putative TAATA box, particularly the -52 to -71 region. 

Example 6 

SBMu200 Sorghum Tassel RT-PCR and SBMu200 Maize cDNA Comparison 
As noted above, SBMu200 is a male fertility gene. When it is mutated, male 
sterility will result. A homologue of SBMu200 was identified in sorghum. The sorghum- 
SBMu200 cDNA was isolated by using the maize SBMu200 gene primers in a polymerase 
chain reaction with sorghum tassel cDNA as the template. The resultant cDNA fragment 
was sequences by methods described supra and then compared to the SBMu200 cDNA 
from maize. Nucleotide sequence comparisons are set forth in Figure 10 and show 90% 
identity. 

As is evident from the above, the SBMu200 gene is critical to male fertility in 

plants. 

Thus it can be seen that the invention achieves at least all of its objectives. 
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